|
|
Accession Number |
TCMCG075C24814 |
gbkey |
CDS |
Protein Id |
XP_007020138.2 |
Location |
join(17647292..17647519,17647634..17647806,17647921..17647993,17648899..17649033,17649153..17649363,17650620..17650768,17657752..17657850,17659368..17659437,17659560..17659695,17660127..17660221,17660304..17660412,17663561..17663726,17664497..17664658,17665627..17665801,17667670..17667833,17668554..17668598) |
Gene |
LOC18593050 |
GeneID |
18593050 |
Organism |
Theobroma cacao |
|
|
Length |
729aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007020076.2
|
Definition |
PREDICTED: DNA mismatch repair protein MLH1 isoform X1 [Theobroma cacao] |
CDS: ATGGATATAGAGGCACCAGGCGAAGCTAAGGAGCTACCGAAAATCCACCGTCTGGACGAGTCGGTCGTCAACCGAATTGCAGCCGGAGAGGTAATCCAACGCCCTGTCTCCGCCGTCAAAGAGCTGGTAGAGAATAGTTTAGACGCCTCCTCCACTTCCATAAGCGTCGTCGTCAAGGACGGTGGCCTTAAACTCATTCAAGTCTCCGACGATGGCCATGGCATTCGACATGAAGACTTGCCCATATTGTGTGAGAGGCATACGACGTCGAAGTTGTCGAAATACGAGGATTTGCAGTCAATAAAGTCGATGGGGTTTAGGGGGGAGGCGTTAGCGAGCATGACATATGTGGGTCACGTGACCGTCACCACCATTACCAAGGGCCAATTGCATGGCTACAGGGTGTCATATAGAGATGGCCTGATGGAGCATGAACCCAAGGCATGTGCTGCTGTCAAAGGAACCCAAATAATGGTTGAGAACCTGTTCTATAACATGATTGCTAGGAGGAAGACACTACAAAATTCTGCAGATGATTATACGAAAATTGTGGACTTGCTAAGCCGGTTTGCCATTCATCATATTGATGTGAGCTTTTCATGTAGAAAGCATGGAGCTGCTAGAGCAGATGTTCACTCGGTTGCGACATCATCAAGGCTTGATGCTATCAGATCTGTGTATGGTCTCTCGGTTGCTCGGAATCTGATAAAAATTGAAGCTTCAGATAATGATCCCTCTAGTTCAGTCTTTGAGATGGATGGCTTCATCTCCAACTCAAATTATGTTGTAAAGAAAACAACAATGGTGCTTTTTATCAATGATAGATTGGTTGAATGCACTGCTTTAAAGAGAGCTCTGGAAATTGTTTATTCTGCAACCTTACCAAAAGCATCAAAACCGTTCATATATATGTCAATTATATTGCCACCCGAGCATGTTGATGTGAACGTGCACCCAACAAAGAGAGAGGTAAGCCTTCTAAATCAGGAAGTTATTATTGAGAAGATACAGTCTGTGGTTGAATCAATGTTAAGAAACTCAAATGAGTCAAGGACATTTCAAGAACAGACAGTGGAATCCTCTCCATCTGTTCCCTCAATTACAAACAATGAGTCACACCTTAATCCCTCACCCTCTGGATCAAAATCACAGAAAGTTCCAGTGCACAAAATGGTAAGAACAGATTCATCAGATCCAGCTGGAAGGTTGCATGCCTACTTGTATAAGAAACCCCAGAACCACCTTGAAATGAATTCAAGCTTGACAGCAGTGAGGTCGTCAGTTAGACAAAGAAGGAACCTGAGGGAAACTGCAGATCTTACTAGCATTCAGGAGCTTATCAATGATATTGATAGCAAATGTCACTCTGGCCTGCTGGACATTGTGAGACAATGCACTTATGTTGGAATGGCAGATGATGTTTTTGCATTGCTTCAGCATAATACTCATCTATATCTTGCTAATGTGGTGAACTTAAGCAAAGAACTTATGTATCAGCAAGTTCTTCGTCGATTTGCTCATTTTAATGCTATCCAACTAAGCGAATCAGCACCCCTGCAAGAGTTACTTATGTTGGCGCTGAAGGAGGAGGAGTTGGACCTAGAATGCAATGAAAATGATGACCTCAAAATGAAGATTGCAGAAATGAATACACAGCTGCTTAAGCAAAAAGCTGAAATGCTAGAGGAGTATTTCTGCATTTTTATTGATTCAGATGGGAATCTGTCTAGGCTTCCAATACTACTTGACCAGTACACTCCAGACATGGATCGTGTTCCTGAATTCTTACTATGTTTGGGCAATGATGTTGATTGGGAAGATGAAAAAAATTGCTTCCAATCACTTGCAGCTGCTCTTGGGAATTTTTATGCCATGCATCCTCCTCTGTTGCCACATCCATCAGGTGAAGGATTGGAATTTTATAGAAAGAGAAAACATGGGAAGAATCCTCAAGATGTAGGAAAGTCTTCTTGTGACATTGGGGATGATATTGAAATAGAGGATGAATTTGAGCACAAACTACTTTCAGAAGCAGAGACTGCATGGGGCCAGCGTGAATGGTCAATCCAACATGTGTTGTTTCCATCCATGAGACTCTTTCTAAAGCCTCCAACGTCAATGGCTGTTAATGGAACCTTTGTCAGGGTGGCTTCACTGGAGAAGCTCTACAGGATTTTTGAGCGATGCTAA |
Protein: MDIEAPGEAKELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVSDDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGLMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSRFAIHHIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSVFEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILPPEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITNNESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVRSSVRQRRNLRETADLTSIQELINDIDSKCHSGLLDIVRQCTYVGMADDVFALLQHNTHLYLANVVNLSKELMYQQVLRRFAHFNAIQLSESAPLQELLMLALKEEELDLECNENDDLKMKIAEMNTQLLKQKAEMLEEYFCIFIDSDGNLSRLPILLDQYTPDMDRVPEFLLCLGNDVDWEDEKNCFQSLAAALGNFYAMHPPLLPHPSGEGLEFYRKRKHGKNPQDVGKSSCDIGDDIEIEDEFEHKLLSEAETAWGQREWSIQHVLFPSMRLFLKPPTSMAVNGTFVRVASLEKLYRIFERC |